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Abstract 

We give error bounds which demonstrate optimal rates of convergence in 
the CLT for the total covered volume and the number of isolated shapes, for 
germ-grain models with fixed grain radius over a binomial point process of n 
points in a toroidal spatial region of volume n. The proof is based on Stein's 
method via size-biased couplings. 
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1 Introduction 



Given a collection of n independent uniformly distributed random points in a d- 
dimensional cube of volume n (the so-called binomial point process), let V denote 
the (random) total volume of the union of interpenetrating balls of fixed radius p 
centered at these points, and let S denote the number of balls of radius p/2 (centered 
at the same set of points) which are singletons, i.e. do not overlap any other such 
ball. These variables are fundamental topics of interest in the stochastic geometry 
of coverage processes and random geometric graphs [9, 10, 13, 18]. 

As n — > oo with p fixed (the so-called thermodynamic limit), both V and S are 
known to satisfy a central limit theorem (CLT) [12, 13, 16]. In the present work 
we provide associated Berry-Esseen type results; that is, we show under periodic 
boundary conditons that the cumulative distribution functions converge to that of 
the normal at the same 0(n~ 1 / 2 ) rate as for a sum of n independent identically dis- 
tributed variables, and provide bounds on the quality of the normal approximation 
for finite n. 

Were we to consider instead a Poisson-distributed number of points, i.e. a Poisson 
point process instead of a binomial one, both of our variables of interest could 
be expressed as sums of locally dependent random variables, and thereby Berry- 
Esseen type bounds could be (and have been) obtained by known methods [8, 15, 17]. 
But with a non-random number of points, the local dependence is lost and the 
de-Poissonization arguments in [13, 16] do not provide error bounds for the de- 
Poissonized CLTs. The early work of Moran [11, 12] on V was in response to queries 
in the statistical physics literature (including the well-known paper of Widom and 
Rowlinson [19]) which specifically addressed normal approximation of V for non- 
random n, and in general, it seems worthwhile to study the de-Poissonized setting 
since in practice one might well observe the actual number of points, in which case 
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the conditional distribution of any test statistic, based on what is observed, will be 
over a binomial rather than a Poisson point process. 

The variables V and S are just two of a large class of variables of interest that 
can be expressed as a sum, over the n points, of terms that depend only on the 
configuration of nearby points in some sense. General CLTs have been developed 
for such variables [16, 14] and general Berry-Esseen type results are available in the 
Poissonized setting [8, 15, 17], but it remains open to provide a generally applicable 
Berry-Esseen type result for such sums when n is non-random (see however [3], 
which is discussed further in Section 2). However, there seem to be good prospects 
of adapting the approach of the present paper (which is new in the geometrical 
setting) to a wider class of geometrical sums. 

Our approach to normal approximation is based on Stein's method via size-biased 
couplings. Given a nonnegative random variable Y with positive finite mean fi = 
E7, we say Y' has the Y size biased distribution if P[Y' e dy] = (y/fj,)P\Y G dy], 
or more formally, if 

E |Y(?(y)] = fjMg(Y') for bounded continuous functions /. (1.1) 

The method of size-biased couplings was introduced by Baldi et al. [2], who used 
it to develop bounds of order a -1 / 2 to the normal approximation to the number of 
local maxima Y of a random function on a graph, where a 2 = Var(Y). Goldstein 
and Rinott [7] extended the technique to multivariate normal approximations, and 
improved the rate to cr _1 for the expectation of smooth functions of a vector Y 
recording the number of edges with certain fixed degrees in a random graph. In [6] , 
the method is used to give bounds of order a -1 for various functions on graphs and 
permutations. 

Here we shall use Lemma 3.1 below, which improves the constant in a more 
general result from [6]. Loosely speaking, this result says that given any coupling 



3 



of Y and Y' on a common space, an upper bound on the distance between the 
distribution of Y and the normal can be found which involves functions of the joint 
distribution of Y, Y' in terms of (i) the unform distance between Y and Y', i.e. the 
L°° norm of Y - Y', and (ii) the variance of E [Y - Y'\Y\. 

In Section 4 we show how to find a coupled realization of Y' that is uniformly 
close to Y, for those Y under consideration here. To do this we show that here the 
size-biasing amounts to conditioning the (binomial) number of points falling in a 
certain (randomly located) p-ball to be non-zero, and can be achieved by modifying 
at most a single point location to obtain Y' from Y, so that \\Y' — Y^ is bounded. 
This construction may be of independent interest, along with Lemma 4.1 (a general 
result on how to size-bias a conditional probability) and Lemma 3.1. 

2 Results 

Let d > 1 and n > 4 be integers. Suppose U±, . . . , U n are independent random d- 
vectors, uniformly distributed over the cube C n := [0, n 1 ^)^ (we write Ui rather than 
U n: i because the value of n should be clear from the context). Write U n for the point 
set {Ui, . . . , U n }. For x,y in the cube C n , let D(x,y) denote the distance between 
x and y under the Euclidean toroidal metric on C n . For x G C n and r > let B r (x) 
denote the ball {y E C n : D(x,y) < r}. Let B i>r denote the ball B r (Ui). Given 
r, the collection of balls B i r form a coverage process (also known as a germ-grain 
model) in C n ; see [9, 18]. Let p > 0, and define 



Then V is the total covered volume for the coverage process with r = p, while S 
is the number of singletons (isolated balls) in the case r = p/2, and may also be 
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V:= Volume (U? =1 £ iiP ) ; 



(2.1) 



n 




(2.2) 



viewed as the number of isolated points in the geometric graph on vertex set U n 
with distance parameter p [13]. 

Let Z denote a standard normal random variable. Given a random variable X 
with SD(X) := y / Var(X) e (0, oo), let Dx denote the Kolmogorov distance between 
the distribution of X (scaled and centered) and that of Z, i.e. 

'X -EX 



D x '■= sup 



< t 



- P[Z < t] 



SD(X) 

Our main results provide bounds in the normal approximation for V and S; if p is 
fixed then as n — > oo, 

D v = 0(n- 1 / 2 ); D s = Q(n' 1 / 2 ). (2.3) 

Recall that a n = Q(b n ) means that a n = 0(b n ) and b n = 0(a n ). We conjecture that 
the first bound in (2.3) can be improved to 6(n -1 / 2 ). 

To state our results more precisely, we need further notation. Set to be the 
volume of the unit ball in d dimensions, i.e. Ti d := -n d l 2 /T(l + d/2), and := n d p d . 
We say two unit balls touch if their closures intersect, but their interiors do not. 
Let Kd (respectively, K* d ) denote the maximum number of closed unit balls in d 
dimensions that can be packed so they all intersect (respectively, touch) a closed 
unit ball at the origin, but are disjoint from each other (respectively, have disjoint 
interiors). Then n* d is the so called kissing number in d dimensions, which has been 
studied for centuries (see [5, 20]). It is not hard to see n* d is an upper bound for Ka, 
and in most dimensions it seems likely that Kd = K* d1 but k 2 = 5 whereas k 2 — 6. It 
is known that ^3 = ^3 = 12. Set K d :— 1 + /c<f. 

Set p v := E [V], p s := E [S], a v := SD(V), and a s := SD(S). It is straightfor- 
ward to write down formulae for p v , ps, o~y and a\\ see (7.1), (7.2) and (7.3). 

Our first two main results provide non-asymptotic upper bounds on the Kol- 
morogorov distance. 



Theorem 2.1. If n > Q d (p, then 



W(n,p) := 2^((3" + 1)^ + l) 2 (l + (2" + 1)6-0 + (ff^) 6 2i ( 

+2# ( 3 (4" + 2> + 8(4V ( 2 ;: 3 3 ff ) + 4 + 1) . (2.4) 
Theorem 2.2. 7/n > max(3 d , 2 d+1 + 1)0, t/ien 



Ds < f 7 n (4) 2 + Sy^I^p) + 24 



775(71, p) := 2(1 + 2K d f (l + (2 d + l)3 d + (^f^ ) 9V ) (2.5) 
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2n - (2 d+l + 1)<A 2 4n - 2 



2 V V n ~ ( 2<m + 1)0 / n-1 

By using the inequality (x + y) 2 < 2(x 2 + y 2 ), the bounds in Theorems 2.1 and 
2.2 can replaced by bounds which are simpler, though less sharp. 

The next result confirms that for large n, all of /ly, &v, Ps, an d cr s are 6(n), so 
that (2.3) follows from Theorems 2.1 and 2.2. To provide details we require further 
notation. 

For < r < 2, write u^(r) for the volume of the union of two unit balls in M. d 
with centers distant r apart (see (7.5) for a formula). Define the integral 

J r , d {p) := dn d [ exp(-pViW)^ 1 ^ (2.6) 
Jo 

and the functions 

gv(p) ■= P d J 2 Ap)-(2 d <P + <P 2 )e- 2 *; (2.7) 
g s (p) := e-^-(l + (2 rf -2)0 + 2 )e- 2 ^ + p d (Jv(p)-^ M (p)). (2.8) 



Also, define r)v(p) '■= lim^oo r)v(n, p) and i]s(p) '■= lim„^oo Vs( n , p)- Formulae for 
these limits are immediate from the definitions (2.4) and (2.5). 

Theorem 2.3. If p is fixed then as n — > oo ; 

lim (1 - n- = lim (n"Vs(p)) = e"*; (2.9) 

lim^- 1 ^) = g v (p)>0; (2.10) 

n— >oo 

hm(n-V|) = g s (p)>0, (2.11) 

n— >oo 

and 

hmsup (n^/v) < (J-%j- 2 + 5^ 2 + {2 n) 

n-oo 5^(p) \ \j gv{p) 1/2 9v{p) 1/A J 

<- S(vS^ + ^) ;<2 - i3) 

liminf (n 1/2 L> 5 ) > (8ng s (p)y 1/2 . (2.14) 

Theorems 2.1 and 2.2 are proved in Sections 5 and 6 respectively. Theorem 2.3 is 
proved in Section 7, where we also derive numerical values for the asymptotic upper 
bounds in Theorem 2.3, for some particular cases. 

Remarks. The limiting variances in (2.10), respectively (2.11), are consistent with 
those given by Moran [11, 12], respectively Penrose ([13], Theorem 4.14). Moran 
and Penrose do not explicitly rule out the possibility that these limiting variances 
might be zero, as we do here. 

Clearly (2.12) and (2.13) imply central limit theorems whereby both (V-pv)l^v 
and (S — ps)/ (r s converge in distribution to the standard normal, thereby providing 
an alternative to existing proofs of these central limit theorems [12, 16, 13]. In the 
Poissonized setting, non-asymptotic bounds analogous to those in Theorems 2.1 and 
2.2 are given in [15] and imply 0(n~ l l 2 ) bounds analogous to (2.12) and (2.13). In 
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the de-Poissonized setting considered here, Chatterjee [3] provides bounds similar 
to those in (2.12) and (2.13), which hold for general metric spaces, but using the 
Kantorovich-Wasserstein distance, rather than for the Kolmogorov distance consid- 
ered here, and without providing any explicit constants. As stated in [3], 'obtaining 
optimal rates for the Kolmogorov distance requires extra work and new ideas'. 

Generalizations of our results should be possible in many directions. These 
include: 

More general germ-grain models. Replace the balls of fixed radius in the descrip- 
tion of V and S by (independent identically distributed) balls of random radius, or 
more generally, random shapes. 

Random measures. Consider the random measure associated with V (the Lebesgue 
measure on the covered region) or with S (a sum of Dirac measures at the isolated 
points), and look at normal approximation for the random variable given by the 
integral of a test function / on C n with respect to that measure. 

Euclidean distance. Suppose in the definition of V and S, that the periodic 
boundary conditions on C n are dropped, i.e. the toroidal distance D is replaced by 
the ordinary Euclidean distance. 

Non-uniform points. Consider a sequence of independent random points (X n ) n >i 
with a common density function v : ~R d — > R. Placing balls of radius r n around each 
point of X n : = {X 1? . . . , X n }, for some specified sequence r n tending to zero, one may 
define quantities analogous to V and S. When r n oc n~ x l d this is a re-scaling of our 
model but allows for non-uniform v. Our approach might also provide information 
about other asymptotic regimes. 

k-nearest neighbors. Let k G N and consider the number of points C/j whose kth 
nearest neighbor in the point set U n \ {Ui} lies at a distance greater than p. The 
case k — 1 reduces to S. 

These extensions generally seem to be non-trivial, and worthy of further study. 



8 



3 Lemmas 



The proof of (2.13) and (2.12) is based on the following result. This result improves 
the constant which would be obtained by applying the more general Theorem 1.2 of 
[6] to the particular case of Kolmogorov distance. 

Lemma 3.1. Let Y > be a random variable with mean ji and variance a 2 G 
(0, oo), and let Y s be defined on the same probability space, with the Y -size biased 
distribution. If P[\Y S — Y\ < B] = 1 for some constant B > 0, then 



< 



5a 2 



11B 2 . 2B 

+ 5A + - 

a Jo 



(3.1) 



where A := v /Var(E [Y s -Y\Y]). 

Proof: Given z G K. and e > 0, define real- valued functions h z and h z>£ by 
h z {x) = l ( _ 00iZ] (a;), h g>e (x) = e' 1 / h z (x + t)dt, zGl 
Then with W :— (Y — /i)/a and Z denoting a standard normal, by definition 

D Y = sup{\Eh z (W) -Eh z (Z)\ : z G R}. (3.2) 

For e > 0, set 

D Y := swp{\Eh ZtE (W) -Eh Z)£ (Z)\ :z£l}. (3.3) 
Fix z and e, and let / be the unique bounded solution of the Stein equation 

f(w) - wf(w) = h Zy£ {w) -Eh z>E (Z) 
for h z>£ , see [4]. With some abuse of notation, let IV s = (Y s — /i)/a. Then 

E [h z , £ {W) -Eh z , £ {Z)\ (3.4) 
= E[f(W)-Wf(W)} 



E 
E 



f(W)-±(f(W)-f(W)) 



f'(W)(l-^(W s -W))-^ 

o a j 



w s -w 



(f'(W + t)-f(W))dt 



The following bounds on the solution / can be found in [4] ; 



\f'(w)\ < 1 and (3.5) 

rtVO 

\f'(w + t)-f'(w)\<(\w\ + l)\t\ + e^ / l [z , z+£] (w + u)du. (3.6) 

Jt/\0 

Noting that EF = EY 2 /fi by (1.1) with g(y) = y, we find that 

(7 (J 2 V /X / 

and therefore, taking expectation by conditioning, and then using (3.5), we have 

IE (f'(W)E \l - -(IV s -W)\W]}\< ^VVar(E [W S -W\W]) = -^A. 
I la \ ) a a 2 

Now, using (3.4) and (3.6) yields 



|E [h z , £ (W)-Eh z , £ (Z)]\<-^A + ^E 



B/a 



+ / £ 

'-B/a ■) I 



(W s -W)V0 

(\W\ + l)\t\dt 

(W s -W)A0 

tvo 

l[z,z+e](W + u)dudt 



< 4A + 7— r(E |W| + 1) + V 1 / / (OAs + 2D Y )dudt 

O 2(7 °~ J-B/aJtAO 



A* D 2 , 2// 



< ^A + 1.4^ + --r-B 2 e- v D Y , (3.7) 
where in the second to last inequality above we have used the fact that 



P[a < W < P] < (p - a)/V2n + 2D Y , 

and in the last, the fact that E \W\ < 1. By (3.2) we see that D Y is bounded by 
(3.7), and since (3.2) and (3.3) imply D Y < OAs + D Y , substitution yields 

ae + b 



D Y < 7 (e) :-- 



1-c/e' 



where a := -, 6 := -h-A + --^--B , and c := -^- T - 
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The optimum bound on Dy is at the positive root of to 7'(e) = 0, namely e = c + r 

where r := \J c 2 + cb/a. 

We wish to calculate 7(0 + r). The denominator equals 

c c(c — r) a(c — r) b + a(c — r) 
1 = 1 = 1 H — = -, 

c + r c 2 — r 2 6 b 

and therefore 

, s 1 fa(c + r) + b\ fc + b/a + r\ 
7(G + r) = & Uc-r) + 6 j = H c + 6/a-J 

/ (c + b/a + r) 2 \ _ n C + b/a + r) 2 \ 
\(c + b/a) 2 -r 2 ) ~ V cb/a + (b/a) 2 J 

f (c + b/a + r) 2 \ _ ( (c + b/a + ^c^+b/a) 2 \ 
~ a { c+(b/a) )- a { c+(b/a) ) 

and this bound on D Y yields (3.1). □ 

Let Bin(n,p) denote the binomial distribution with parameters n e N and p e 
(0, 1). Our next two lemmas are concerned with binomial and conditioned binomial 
distributions. Lemma 3.2 is used to prove Lemma 3.3. 

Lemma 3.2. Let m E N and p G (0,1). Suppose N ~ Bm(m,p), and C(N') = 
C(N\N > 0), N" - 1 ~ Bin(m -l,p). Then for all keN, 

P[N >k}< P[N' >k}< P[N" > k]. (3.8) 

Proof. The first inequality in (3.8) is easy since for k > 1, by definition P[N' > 
k] = P[N > k]/P[N > 1]. It remains to prove the second inequality. Suppose 
£i,£ 2 , • • • are independent Bernoulli random variables with parameter p. Let M = 
mm{i : £j = 1} and N" := ^fttP' 1 Then M and TV" are independent and 
C(N") = C(N"). 
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Define the random variables 




i=m(J-l)+l 



In other words, split the sequence of Bernoulli trials into disjoint intervals of length 



interval that contains at least one successful trial. 

Then N' has the distribution of N', and by construction N' < N" almost surely. 
Since N" has the distribution of N" , this shows that N' is stochastically dominated 



Our next lemma demonstrates the existence of a 'uniformly close coupling' of 
random variables with a binomial distribution, and with the same distribution con- 
ditioned to be non-zero (denoted respectively N and M in the lemma). This will 
be used in Section 4 to provide a uniformly close coupling of V (given by (2.2)) and 
its size biased version, and likewise for S (in fact, for n — S). 

Lemma 3.3. Let to G N and p G (0, 1). Suppose N ~ Bin(m,p) 7 with N = Y1T=\ & 
where are independent Bernoulli variables with parameter p. Defining n k by 



we then have < i\k < 1 for all k G {0, . . . , to}. 

Suppose also that B is a further Bernoulli variable with P(B = l|£i, . . . ,£ m ) = 
7Tjv ; and suppose I is an independent discrete uniform random variable over {1, 2, ... , to}. 
Set M := N + (1 — £/)£>, i.e. let M be given by the same sum as N except that if 
B = 1 the Ith term is set to 1. Then 



to, and let N' denote the number of successful Bernoulli trials in the first such 



by N", i.e. the second inequality in (3.8) holds. 



□ 




(3.9) 



C(M) = C(N\N > 0). 



(3.10) 
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Proof. Lemma 3.2 shows ir k > 0. For the upper bound, set N" = 1 + Y^?=2&- 
Then N" - 1 ~ Bin(m - l,p) and N" is equal either to iV or to iV + 1, with 
P[N" = k + 1\N — k] — 1 — (k/m) for < fc < to. Hence for all A;, by Lemma 3.2, 

P[iV > fc] + P[iV = fc](l - k/m) = P[N" >k}> P[N > k\N > 0] 

so ir k < 1. Also, assertion (3.10) follows by (3.9) and the fact that 

{M> k} = {N>k}U{N = k,B = 1,^ = 0}. □ 

Our next result refers to measurable real-valued functions ip defined on all pairs 
(x, X) such that X is a finite subset of C n and x E X. We say that such a functional 
ip is translation-invariant if ip(x, X) = ifj(y + x, y + X) for all x, X and all y G C n 
(here addition is in the torus C n , and y + X := {y + w : w G Af}). For r > 0, we say 
that ip has radius r if ^(x, A") is unaffected by the addition of points to, or removal 
of points from, the point set X at a distance more than r from x, i.e. if for all 
(x, X) we have ip(x, X) = ip(x, X fl B r (x)). The notion of radius is the same as that 
of range of interaction used in [15]; see also the notion of radius of stabilization, in 
[17, 15] and elsewhere. We also define 

II^H := ess swp XjX {\ij)(x, X)\}; 
rng(^) := ess sup x X {ip(x, X)} - ess iai XiX {tp(x, X)}. 

Recall that U n := {Ui, . . . ,U n } denotes a collection of n independent uniformly 
distributed points in C n , and n d is the volume of the unit c?-ball. 

Lemma 3.4. Let n G N and fceN with 2 < k < n. Suppose that for % — 1, . . . , k, 

ipi is a measurable real-valued function defined on all pairs (x, X) with X a finite 
set in C n and x G X . Suppose for each i that ipi is translation-invariant and has 
radius r\ for some r\ G (0, oo), and that < oo, and E [0i(^i, U n )] = 0. With 
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(pi := n^rf, suppose also that <f>2 + • • • + (fik < n. Then 



E 



i=l 



< 



i=2 



X i K„ | ^Jr, / , 



i=2 'ri 



J=2 / \«-£i=2& 

Proof. Given x = (xi, . . . , x k ) G C*, define the set of points 



:= {x 1 , . . . , x k , U k+ i, • • • , C/ n }. 



Let F n be the set of x = (x 1: . . . , x k ) G such that D(x 1 ,x i ) > r\ + r-j for 
« G {2, . . . , fc}, and let := \ F n . Then by the law of total probability, 



E 



i=i 



p k n k 

n- k EfI^„W> + ^ Ej]^(x,,^)rfx. 

i= i i=i 



Since E [^1(^1, W„)] = it follows that 1 1 -0x 1 1 < rng(V'i), so that 



n 



i=l 



< 



niwijp[(^i,...,^)ei^ 

/ k \ k 

< rng(^) (nil^ll ^^(n +^7^(3.11) 



vi=2 



i=2 



Fix x = (#1, . . . ,x fc ) G F n . For m G Z + , let hi(m) := E'0 1 (xi, {a^} U ^ m ), where 
y m denotes a collection of m uniformly distributed points in B ri (xi). Let h2(m) := 
E Yli=2 i^iixii { x 2, ■ ■ ■ i x k} U y' m )i wnere y' m denotes a collection of m uniformly 
distributed points in L)f =2 B ri (xi). 

If Ni and N 2 denote the number of points of {U k +i, ■ ■ ■ , U n } in B ri (xi) and 
in \Ji =2 B r .(xi), respectively, then the values of ipi(xi,U^) and of Y\i=2 ^ii^Mn) 
are conditionally independent, given (Ni,N 2 ), because the regions B ri (xi) and 
Ui =2 B ri (xi) are disjoint since we assume x G F n . Hence, we assert that 



E 



E [h 1 (N 1 )h 2 (N2)\ 



(3.12) 
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where (Ai, A 2 , A 3 ) have the multinomial distribution 

(N u A 2 , A 3 ) ~ Mult (n - k; — , — , — ) (3.13) 
V n n n / 

with ai denoting the volume of a ball of radius r\ in C n (so that ai < (f>i — itd^t 
with equality if r 1 < (l/2)n l / d ) while a 2 is the volume of U\ =2 B ri (xi) in C n and 
03 := n — ai — a 2 . To verify (3.12), use the law of total probability to decompose 
the left hand side as a sum over possible values of (Ai, A 2 ). 

Also, if Ni ~ Bin(n — 1, ^-) then E [/i 1 (A^ 1 )] = 0, because of the assumption that 
E [ipi(Ui, U n )] = 0, along with translation invariance; the value of K[hi(Ni)] does 
not depend on x±. 

We give a coupling of Ni to another random variable N[ with the same distri- 
bution as Ni that is independent of N 2 , for which we can give a useful bound on 
P[N^N[\. 

Consider throwing a series of colored balls so each ball can land in one of three 
urns, where the probability of landing in Urn % is a^/n for 1 < i < 3. First, throw 
n — k white balls and let N^,N 2 ,N^ denote the number of white balls in Urn % 
for i = 1,2,3 respectively, that is, let (A^*, N 2 , A3*) have the Mult(ra - k; f, f, f) 
distribution. Now pick out the n — k — N 2 balls in Urns 1 and 3, paint them red, 
and throw them again; that is, given the values of A*, A 2 , A3 let A]", A£, A3 count 
the number of red balls in Urns 1, 2, 3, respectively, and so be nonnegative integer 
valued variables such that 

£ ((NI, N;, N;) I A;, A 2 ) = Mult (n - k - A 2 ; ^, ^, ^) . 

Now take the A£ red balls in Urn 2, paint them blue, and throw them again but 
condition them to land in Urns 1 and 3 (or equivalently, throw each blue ball again 
and again until it avoids Urn 2), so that 

C ((Aj\ A 3 b ) I A*, A 2 , A[, N r 2 ) = Mult ( N r 2 ; — ^— , -^—) ■ 

\ ai + a 3 a 1 + a 3 J 
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Finally, throw k — 1 + N 2 green balls, making the total number of green, red and 
blue balls n — 1, and record how many land in Urn 1, so 

C {Nf\N*, N 2 , Nl, N r 2 , Afl = Bin (k - 1 + N 2 ; ^ 

\ 71/ 



Now set 



Ni = Nl + N* and N 3 = N; + N%, and N[ = N{ + Nf. 

Then (Ni,N 2 ,N 3 ) have the multinomial distribution given by (3.13). Also, N[ 
Bin(n — 1, — ) and N[ is independent of A^. 
Since JV( = JVi - iVj + JVf , we have that 

ai 



P[Ni ^N[] < E [Nf] + E [iV*] < ^ (A; — 1 + E N 2 ) + ^ 

< ^ (fc - 1 + 02) + f-^-) 02 
n \n — a 2 J 



ai + a 3 



so that 



|E [h 2 (N 2 )(hi(Ni) - hi(N[))}\ < °± [k - 1 + a 2 + (^^)) mg(^i) J] \M 
and since Aj is independent of N 2 with A] ~ Bin(n — 1, ^), 

so by (3.12) and the fact that a\ < <pi and a 2 < Ei= 2 4>% an d the assumption that 



it 



E 



1=1 



ai / / 2n - a 2 

< — A; - 1 + a 2 I i ni 

n V \ n-a 2 



g(^) n 



i=2 



2 r! 



rng(^i) JJ \\A 



The preceding bound holds uniformly over all possible values of x = (x\, . . . , Xk) G 
F n . Combined with (3.11), this shows that the asserted bound holds. □ 
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Lemma 3.5. Suppose tpi is as defined in Lemma 3.4- Then with notation from that 
result, if (pi < n then 



Var 



1 " 

n 

i=l 



< 



n 



(1 + 2%) + 



ll^i 



n 



>1 + 0I 



2n — 0x 



n 



Proof. By the case k — 2 of Lemma 3.4, 



rngOi). 



< 



n 



2 d 1 ||Vi||+0i l + 0i 



2n 



n - 0i 



rng(-0i) 



and since 



Var 



i=i 

the result follows. 



n — 1 

n 



□ 



4 Size-biased coupling constructions 

We now give a simple lemma which shows how to size-bias a random variable that 
can be expressed as a conditional probability of an event arising from some further 
randomization. 

Lemma 4.1. Suppose Y is a random variable given by Y = aP[A|jF] ; where T is 
some o -algebra, a > is a constant, and A is an event with < P[A] < 1. Then Y' 
has the Y size biased distribution if 

C(Y') = C{Y\A) (4.1) 

Proof. With C(Y') defined by (4.1), we must show for all bounded and continuous 
g : R -> R, that E [<?(Y')] = E [Y#(Y)]/E [Y] (see (1.1)). But 

E[g(Y')]=E[g(Y)\A] = E[g(Y)l A ]/ P[A] 

= E[g{Y)P[A\^\\/P[A\ 
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where the last equality follows because g(Y) is T- measurable. The last expression 
equals E [Fp(F)]/E [Y], as required. □ 



Let V, S be given by (2.1), (2.2). Set W = n — S (the number of non-singletons). 
We assert that either V or W can be expressed as n times the conditional probability 
of some event A, given the locations of the points of U n , so that Lemma 4.1 is 
applicable. For V, take A = A v to be the event that an additional uniformly 
distributed random point Uq in C n lies in the covered region U™ =1 -Bj iP . For W, take 
A = Aw to be the event that an element of U n , selected uniformly at random, is 
non-isolated. 

Event Ay can be written as the event that Ny > 0, where Ny denotes the 
number of points of U n in B p (Uq), and Ny ~ Bin(n, <f)/n) (recall <fi := 7idP d and C n 
has volume n.) A point set (denoted Uy) with the conditional distribution of U n 
given N v can be obtained as follows: 

I. Sample a uniform random point in C n , denoted Uq. 

II. Set m = n. Sample iV = Ny independent uniform random points in B p (Uq), 
and m — N independent uniform random points in C n \ B p {Uq). 

III. Let Uy be the union of the two samples of uniform points. 

Therefore, coupled realizations of Uy and U v (having respectively the distribution 
of U n and the conditional distribution of U n given Ny > 0), and hence coupled 
realizations of V and V, can be obtained as follows. 

1. Set m = n. 

2. Sample Uq uniformly at random over C n . 
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3. Sample m random d- vectors independently and uniformly over C n , and denote 
this point set by U m ,i- 

4. Let iV denote the number of points of U m> \ in B p (U ). 

5. Sample a Bernoulli random variable B with P[B = 1] = n N , where k > 0) 
is given by (3.9). 

6. Sample a random d- vector U which is uniform over B p (U ). 

7. If B — 1, then select one of the points of U m ,\ uniformly at random, and move 
it to U . Denote the resulting modification of U mt \ by U m ^- If B — then set 
Um t 2 '■ = U m ,i- 

8. Set Uy := U m ,i and W v := U m ^- Set V := g v (U v ) and V := g v (Uy), where 
g v {U) := Vo\(U xeU B p (x)). 

By Lemma 3.3, the number of points of U m ^ in B p {Uq) has the distribution C(Nv\Ny > 
0), and hence C(U V ) = C(U V \N V > 0). So by Lemma 4.1, V' has the V size biased 
distribution. 

In the case of W, Aw is the event that Nw > 0, where Nw denotes the number 
of points of U n \{Uo) in B p (Uo), and now Uq denotes a point of U n selected uniformly 
at random. So N w ~ Bin(n — 1, (f>/n). We can obtain a point set (denoted Uw) with 
the conditional distribution of U n given A^vk by the same steps as for Uy except that 
now in Step II we put m = n — 1 and iV = Nw, and in Step III, Uw is the union 
of the two samples of uniform random points with an added point at Uq. Hence, 
we can obtain coupled realizations of W and W by the same sequence of steps as 
described above for (V, V), except that the following steps are modified: 

• In Step 1, set m — n — 1 (this affects Steps 3 and 5.) 
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• In Step 8, set U w := U mA U {U }, and U' w := U m<2 U {U }. Set W := g w (U w ) 
and W := g w {W w ) with g w {U) := £^ W H ^ {x}}. 

By a similar argument to the V case, W has the VF size biased distribution. 

5 Proof of Theorem 2.1 

We couple V to V as described in Section 4. Since V differs from V through the 
moving of at most a single point, clearly \V — V\ < n d p d := 0. Hence, by Lemma 
3.1 with B = (f), to prove Theorem 2.1 it suffices to prove the following. 

Proposition 5.1. Under the assumptions of Theorem 2.1, Var(E [V — V\V\) < 
n~ l rjy{n, p) , where r] V (n,p) is given by (2.4). 

Proof. Let Q be the a-algebra generated by the point set Uy. Information about 
which points of Uy came from where is not included in Q. List the points of Uy, 
in an order chosen uniformly at random, as Ui, . . . , U n , and set U := (Ui, . . . ,U n ). 
Then V is ^-measurable. The conditional variance formula, with X = E [V — V\Q], 
yields 



For x G C n , let £ x denote the probability that B — 1, given U n and given that 
Uq = x, i.e. £ x = 7Tn x , where N x denotes the number of points of Uy in B p {x). Let 
R X j denote the expectation (over U) of the increment in the covered volume if Uj is 
moved to a uniform randomly selected location U in B p {x). Note that for x and j 
fixed, R X j is determined by U. Then, since both U and / are independent of Q, 



Var(E [V - V\V\) = Var(E [X\V]) < Var(X), 



so it suffices to prove 



Var(E [V - V\g\) < n^rjvin, p). 



(5.1) 
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where the first factor of 1/n comes from the probability density of Uo, and the second 
arises as the probability that / takes the value j. 

Let H x be the expectation (over U) of the increment in the covered volume when 
a point is inserted into Uy at a uniform random location U G B p (x), and let Tj be 
the increment in the covered volume when point Uj is removed from U (for fixed 
x and j, both H x and Tj are determined by U). If Uj is far distant from x then 
R X j = H x + Tj. Set Q X j := R X j — H x — Tj, which is in fact the expectation (over U) 
of the total volume of the otherwise uncovered regions lying within distance p both 
of U and of Uj (such regions contribute to Tj but not to H x or R X j). Then 



M[v'-v\g] = 




& \H x + -J2Q X j) dx+\ I J^CxTjdx. (5.2) 

Set := vr d p d . We have that < H x < (f), > Tj > -(f) and if D(x, Uj) > 3p 
then Q X j = 0. Moreover, if D(x, Uj) > 3p for all j G {1, . . . , n}, then H x = <p and if 
D(x, Uj) > p for all j G {1, . . . , n}, then £ x = 1. Finally, Q X j > and 

j n n 

< ff a + -^Qxj < H x + ^2Q xj < 0- 

Hence setting 

/ 1 n \ 

v n U J 

we have that —0 < < 0, and r x is determined by the collection of points of U n 
within distance 3p of x, and t x = if there are no such points of U n . We can rewrite 
(5.2) as 
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Recalling that B r (x) := {y G C n : D(x,y) < r}, let T iyT be the set of points 
y G B r (Ui) such that D(y,Ui) < D(y,U j ) for all j G {l,...,n}\ {%} (i.e., the 
intersection of the r-ball around Ui and the Voronoi cell of Ui relative to U n ). Set 

S[ ■= / r x dx, S" := / (£ x - l)cfcc. 

e [V - v\g] = * + £ s<) + ^ ij) + £ s-r, 

( ^ E S " T A ■ < 5 3 > 



Then 



+ 



and if we put b = ETj (which does not depend on i), we have 



n 2 E S " T i n 2 

\( 



so by (5.3), 



1 " 

+- + T * + ( n ~ lT * + (! - »"»<S7)- 



Since (x + 7/) < 2(x + y ) for any real x, y, 



( 1 " 

Var(E [V - V\Q\) < 2Var - ^(^ + T t + (n" 1 ^ + (1 - n" 1 )^' 

\ n i=i 

+ 2Var (^( E ^--6)] ) 



(5.4) 



We have the following table of radii and bounds on possible values (the last two 
columns are deduced from the previous columns): 

22 



Variable 


T x 


T 


s'l 


S'i 


(n^T 




radius 


3p 


2p 


2p 


6p 




2p 


lower bound 


-<f> 


-<t> 


-<t> 









upper bound 
















2 



Hence, the variable 

Si + Ti + (n^T + (1 - n" 1 )^^' 

has radius 6p relative to C/j and lies between — — 3 d 2 and 2 , so that its centered 
value is bounded in absolute value by (3 d + 1)0 2 + 0, and this also bounds its range 
of possible values. So by Lemma 3.5 and the assumption that 6 d < n, 



Var U £ (a,' + T, + (n" 1 ^ + (1 - n" 1 )^') j 



< ^ ((3d + + 1)2 (l + (2* + 1)6V + ) 6 V ) ■ (5.5) 



Now consider the last term in the right hand side of (5.4). Set Tj := Tj — b. Then 

Var ( J2 S iTj =n(n- l)(n - 2)(ra - 3)Cav(SfT 2 , S3T4) 

+n(n - \){n - 2) (Cav(S'{f 2 , S'{f 3 ) + Cov(^7\, ^T) + 2Cov(SfT 2 , ^T)) 

+n(n - 1) (Var(^'T 2 ) + Cov(S'{f 2 , S' 2 %)) . (5.6) 

It follows from the case k — 4 of Lemma 3.4 and the assumption 6 d < n (which 
implies 3{2 d <p) < n) that 

Cov(S'{T 2 , S^T 4 ) = E [S'(f 2 S'^T 4 \ - (E [S'{f 2 ]) 2 < E [S'{T 2 S^T 4 ] 

^ (ui , o^. . ,^ 2 /2n-3(2 rf )0\ 



23 



Since we can always bound Gov (S"Tj, S"Ty) above by 4 , we have from (5.6) that 

< £( 3 (4< + 2^ + 3(4V(^|^) + 4 + !). (57, 
By (5.4), (5.5) and (5.7) we have that 

(n/2)Var (E [V - V\Q\) 
< 2 ((3 d + 1)0 + I) 2 (l + {2 d + 1)6^0 + (^£^) 6 2 V) 

^(3(4' + ^ + 3(4V(^g) + 4 + ?). 
This completes the proof of Proposition 6.1, and hence of Theorem 2.1. □ 



6 Proof of Theorem 2.2 

We couple W' to W as described in Section 4. Thus = gw(Uw) and W = 
gw(U w ), where U' w is obtained from Uw by moving at most a single randomly 
selected point of Uw \ {Uo} to a (uniform random) location in B p (Uo), if B — 1, and 
leaving unchanged if £> = 0. 

The number of points that can be made isolated by removing a single point from 
Uw is almost surely bounded by n d . Moreover, the number of points that can be 
made non-isolated by inserting a point (including the inserted point itself) is almost 
surely bounded by Kd + 1. Hence \W — W'\ < Kd + 1, so we may take B — Kd + 1- 

By the symmetry of the normal distribution, D_s — D$ and hence Dw = D$- 
Thus, Theorem 2.2 follows from Lemma 3.1 along with the following: 

Proposition 6.1. Under the assumptions of Theorem 2.2, Var(E [W — < 
n^rjsin, p), where r)s(n, p) is given by (2.5). 
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Proof. Here we let Q denote the a-algebra generated by the unlabelled point 
set U := Uw, containing no information about which points of U come from where. 
Then W' is ^-measurable, and by the conditional variance formula (as in the proof 
of Proposition 5.1), it suffices to prove that 



Label the points of U, in an order chosen uniformly at random, as U\, . . . , U n , 
and set U := (Ui, . . . , U n ). £j = n Ni , where iVj denotes the number of points of 
U \ {Ui} in B p (Ui). Let Rij denote the expectation (over U) of the increment in 
the number of non-isolated points when Uj is moved to a uniform randomly selected 
location U in B p (Ui). Then 



where Ylujyi^j denotes summation over pairs of distinct integers i,j in [l,n]. 

Now let Hi be the expectation (over U) of the increment in the number of isolated 



and let Tj be the increment in the number of isolated points when point Uj is 
removed from U (both Hi and Tj are determined by U). If Uj is far distant from 
Ui then = —Hi — Tj. In fact, setting Qij := + Hi + Tj, we have that Qij 
is the expectation (over U) of the number of otherwise isolated points of U within 
distance p both of U and of Uj (such points contribute to Tj but not to Hi or R^). 



Var(E [W -W\G\)< n^rjsin, p). 



(6.1) 




points when a point is inserted into U at a uniform random location U G B p (Ui), 



Then 



e[W- w\g\ 



1 



U-Hi-Tj + Qij) 



n(n — 1) 




(6.2) 
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where we set 

n := -Hi + 



l — E Qa- ( 6 - 3 ) 



n . 



Put a : = E (given n, this expectation does not depend on i) and put 6 := 
(««, - l)/2. Then 



1 1 



-7 TV E W = "7 TT E 



Hence we can rewrite (6.2) as 

n 

E - = - E " «^ " Kb ~ a)) " 7 _ n E & " " 6 )' 

Since (x + y) 2 < 2(x 2 + y 2 ) for any real x,y, it follows that 

Var(E\W'-W\g]) < 2Vm (^^(^(n - b) + a(b - T,))^ 



+2Var 



l^h) E «.-<om-»Y (6.4) 



We have that — n d < Hi < 0, — 1 < Tj < n d) and Qij > 0. Also, 

< -Hi + E Qij < K ^ 

and if D(Ui,Uj) > 3p then = 0. Hence, < Tj < k^, and r, is determined by 
the collection of points of U within distance 3p of C/j. To summarize, if the radius 
of a variable indexed by i is the smallest distance from C/j one needs to look to 
establish its value (as with the functionals considered in Lemma 3.4), we have the 
following (where the last three columns are deduced from the preceding columns 
and the definitions of a, b): 
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Variable 


Hi 


Ti 


ii 


n 


Un - b) 


a(b-Ti) 


to-6) + a(6-T i ) 


radius 


3p 


2p 


p 


3p 


3p 


2p 


3p 


ess inf 


-K d 


-1 








(1 " K d )/2 


-(« d + l)/2 


-«d 


ess sup 







1 




(«„+l)/2 


(« d + l)/2 


re d + 1 



From the last column in this table, we see that after centering, the terms in 
first sum in the right hand side of (6.4) have radius 3p and absolute values bounded 
by 1 + 2n d . Moreover, even after centering each of these terms has range (i.e. 
essential supremum minus essential infimum) which is also bounded by 1 + 2nd (this 
range is unaffected by the centering). Hence with <fi := 7idp d , Lemma 3.5, using the 
assumption 3 d (f> < n, yields 

Var (^fL(U T i-b)+a(b-T^ 



< 



Now consider the second sum in the right hand side of (6.4). Set & := £j — a and 



Tj :=Tj- b. Then 



Var J2 & " a )( T i -b)\ =n(n- \){n - 2){n - ^Covpj,^) 

+n ( n - l)(n - 2) (Cov(6f 2 , fiT 3 ) + CovfeTi, f 3 7\) + 2Cov(af 2 , f 3 7\)) 

+n(n - 1) (Var(6f 2 ) + Cov(6T 2 , £ 2 Ti)) . (6.6) 

Note that ^ has absolute value bounded by 1, and range of possible values also 
bounded by 1, and mean zero. Also, Tj has absolute value almost surely bounded 
by («d + l)/2 (its mean might not be zero). Hence, the case k — 4 of Lemma 3.4 
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(taking r\ = r 2 = p and r 3 = r 4 = 2p so that 2 + 03 + 04 = (2 d+1 + 1)0) yields 
Cov(6T 2 , £ 3 T 4 ) = E [6T 2 £ 3 f 4 ] - (E [6T 2 ]) 2 < E 

s !si!E(^ +! , + , +r + W (tgi|)) 

where we have also used the assumption that (2 d+1 + 1)0 < n. Since we can always 
bound Cov(&Tj,&/7>) by ((/c d + l)/2) 2 , we have from (6.6) that 



« d + l\ 2 /4 2 



By (6.4), (6.5) and (6.7), we have that 
nVar (E [W - W\Q\) 

4- 1 4- I 

n - 3 d 



< 2(1 + 2/t,) 2 ( 1 + (2 d + l)3 d + ( 2n 3 .J ) 9^ 



+ <*±±)V 4 + 2 



(2(3-) + 2^ + 3)0 + (2^ + 1) ( ^jff 



2 V 

This completes the proof of Proposition 6.1, and hence of Theorem 2.2. □ 

7 Proof of Theorem 2.3, and numerics 

Again set := vr rf p d . It is easy to see that provided 2p < n 1 ^, 

E[V] =n(l-(l-0/n) n ); E [£] = n(l — 0/n) ra_1 , (7.1) 
and (2.9) follows from this. 

28 



Write | • | for the Euclidean norm and recall that denotes the volume of 

the union of unit balls centered at the origin and at x. If I x denotes the indicator 
of the event that x is not contained in any of the balls B p>i , then provided 4p < n}l d 
we have the exact formula 

Var(F) = Var(n - V) = Var j I x dx (7.2) 

= ®[IJy]dxdy - (n(l - 4>/n) 

J Cn J C n 



n\2 

kl, \± x ±y\ujbuy — V'H 1 — H J l' ,b ) 

C n J Cn 

, ._ ^a^uj^ . dy + _ 2d(j)) r _ 2f\ U _ _ ^ )2 „ 



B 2p (0) 



n \ n 



Proof of (2.10). For asymptotics as n — > oo with p fixed, use the MacLaurin expan- 
sion of log(l — x) to obtain 

n J \ n 

2n / ^2 



so that 

/• / n d ,,,J\i,\ln\\ n 

dy 



n -^ [v) = f u. ^mm 

Jb 2p (o) \ n J 

+ne -20 - exp(-^ ) + 0(n~ 2 )) 

\ n n n J 

eM-p d M\y\/p))dy ) - ^{2^ + <f) 



'^(0) 

and this limit is equal to gv(p) as defined by (2.7), so the first part of (2.10) is 
proven. 

It remains to show that gv(p) > 0. This can be done either by using the last 
part of Theorem 2.1 of [16]), or directly. We leave it to the reader to check that the 
conditions of the last part of Theorem 2.1 are satisfied here, or to look up the direct 
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argument which is in the first version of this paper (arXiv:0812.3084). Thus (2.10) 
holds in its entirety. □ 



The computations for S are somewhat similar. With Xi denoting the indicator 
of the event that Ui is isolated, 

Var(S) = nVar(Xi)+n(n-l)Cov(Xi,X 2 ) 

= n(l - <j>/n) n -\\ - (1 - <p/n) n - 1 ) + n(n - l)Cav(X 1 ,X 2 ). 

Since Cav(X 1 ,X 2 ) = E [X^] - E [Xi] 2 , provided 4p < n 1 ^ we can write 



Var(S) = ra(l - <f>/n) n -\\ - (1 - 

'B 2p (0)\B p (0) ' 



J b 2p (o)\b p (o) V n J 



Proof of (2.11). For asymptotics as n — > oo with p fixed, by again using the 
MacLaurin expansion of log(l — x) we obtain 

1 - - j " " = exp((n -■>)(■ - 0(n- 3 )) 



and 



n l \ n n 

expf-20 + 40 - 202 + O(n- 2 ) 



I--) 2 " 2 = expf(2n-2)(-^-^ + 0(n- 3 )) 



1:2 



exp ( -20 + — — — + 0(n- 2 ) 
n 
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and hence the last term in the right hand side of (7.3) is equal to 



n(n — 1) exp(— 20) 

x 1 exp( ) - exp( ) + U(n t 



so that 



n J n n 

= n(n - 1 exp -20 + — - — + O n" 2 , 

\ n n n 



lim n- 1 Var(5) 

e -*(l _ e -*) _ e ^((2 d - 2)0 + 2 ) + / e-^^l^dy 

J B 2p (0)\B p (0) 

- (1 + (2 d - 2)0 + 2 )e- 2< ^ + p d / e-^ (|u|) rf M , 

Jbo(0)\Bi (0) 



— e 

'B 2 (0)\Bi(0) 

and since this limit is equal to gs{p) as defined by (2.8), we have proved the first 
part of (2.11), namely, convergence to gs{p)- 

To complete the proof of (2.11), we need to show that gs(p) > 0. This can be 
done by the same arguments for proof of (2.10). Hence, (2.11) holds in its entirety. □ 



Proof of Theorem 2.3. It remains only to prove (2.12), (2.13), and (2.14). By 
definition rjv(p) = lim^,^ r)v(n, p) and77s(p) = lim^oo 775(71, p). Then (2.12) follows 
at once from Theorem 2.1, along with (2.9) and (2.10). Similarly, (2.13) follows at 
once from Theorem 2.2 along with (2.9), and (2.11). 

Finally, we demonstrate the asymptotic lower bound (2.14). For any random 
variable X, let F x denote its cumulative distribution function and let f x denote its 
probability density function (if it has one). Let e G (0, 1). Set 

. _ [ps] ~ Ps . _ [ps] ~ Ps + 1 ~ £ 
ii .— , r 2 •— • 

Here [•] denotes integer part, so that \U\ < a^ 1 for i — 1,2. By the unimodality of 
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the standard normal density, 



F z (t 2 ) ~ Fzih) > fa-tjmmifzitjjzih)) 
> {l-e)a- s 1 f z {a- s 1 ). 



(7.4) 



On the other hand, since S is integer- valued, F(s-n S )/ as (ti) is equal to F^s-^/asi^), 
so that by (7.4) 

D s > (1/2)(1 -s)a s 1 f z (a s 1 ). 
Scaling by n 1//2 , letting n — > oo, using (2.11) and letting e — > yields (2.14). □ 

To conclude, we compute some numerical values for the asymptotic upper bounds 
appearing in (2.12) and (2.13). For this we need to compute J r ,d(p) defined by (2.6) 
(for r = 1 and r = 2), and for this in turn, we need to compute Wd{u), the volume of 
the union of two unit balls in <i-space whose centers are at points (x, x' say) distance 
u apart (u < 2). Clearly, uj\{u) — 2 + u, and generalizing (6) of [11] to arbitrary 
d > 2, we have 



Wd{V>) = 7T d + 7T d _! / (1 - (t/2) 2 )^ 2 dt, d>2. 
10 



(7.5) 



Using the preceding formulae, we have computed numerical values for the asymptotic 
upper bounds in Theorem 2.3, for the cases with p — 1 and d < 3. These are as 
follows to five significant figures, where <V(p) denotes the right hand side of (2.12) 
and 5s(p) denotes the right hand side of (2.13). 



Sv(D 



6.4252 x 10 3 if d=l 
8.6212 x 10 5 if d = 2 
1.4451 x 10 8 if d = 3 



S,(l) 



2.1024 x 10 3 if d=l 
4.6833 x 10 4 if d = 2 
1.0578 x 10 6 if d = 3 
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